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Abstract 

The tight span, or injective envelope, is an elegant and useful construction that 
takes a metric space and returns the smallest hyperconvex space into which it 
can be embedded. The concept has stimulated a large body of theory and has 
applications to metric classification and data visualisation. Here we introduce a 
generalisation of metrics, called diversities, and demonstrate that the rich theory 
associated to metric tight spans and hyperconvexity extends to a seemingly 
richer theory of diversity tight spans and hyperconvexity. 
Keywords: Tight span; Injective hull; Hyperconvex; Diversity; Metric 
geometry; 



1. Introduction 

Hyperconvex metric spaces were defined by Aronszajn and Panitchpakdi in 
PQ as part of a program to generalise the Hahn-Banach theorem to more general 
metric spaces (reviewed in [2J, and below). Isbell [3] and Dress [3] showed that 
every metric space could be embedded into a minimum hyperconvex space, 
called the tight span or injective envelope. Our aim is to show that the notion of 
hyperconvexity, the tight span, and much of the related theory can be extended 
beyond metrics to a class of multi-way metrics which we call diversities. 
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Recall that a metric space is a pair (X, d) where X is a set and d is a function 
from X x X to 5i satisfying 

(Ml) d(a, b) > and d(a, 6) = if and only if a = b. 
(M2) d(a,c) < d(a,b) + d(b,c). 

We define a diversity to be a pair (X, 5) where X is a set and 5 is a function 
from the finite subsets of X to 3? satisfying 

(Dl) 5(A) > 0, and 5(A) = if and only if \A\ < 1. 
(D2) If B ^ then <5(A U C) < 5(A U B) + S(B U C). 

We prove below that these axioms imply monotonicity: 

(D3) If A C B then 6(A) < 5(B). 

We will show that tight span theory adapts elegantly from metric spaces to 
diversities. The tight span of a metric space (X, d) is formed from the set of 
point-wise minimal functions / : X — > 3? such that f(a\) + /(12) > ^(01,^2) 
for all Q,i,a2 G X. Letting 'Pfin(X) denote the finite subsets of X, the tight 
span of a diversity (X, 5) is formed from the set of point-wise minimal functions 
/ : Vnn(X) ->■ 3? such that 

f(A,) + f(A 2 ) + ■■■ + f(A k ) > 5(A 1 Ui 2 U...U4) 

for all finite collections {A\, A2, . . . , Ak} Q ^^(X). We can embed a metric 
space in its tight span (with the appropriate metric on the tight span); we can 
embed a diversity in its tight span (with the appropriate diversity on the tight 
span). Both constructions have characterisations in terms of injective hulls, and 
both possess a rich mathematical structure. 

The motivation for exploring tight spans of diversities was the success of the 
metric tight span as a tool for classifying and visualising finite metrics, follow- 
ing the influential paper of Dress 0] . The construction provided the theoretical 
framework for split decomposition [S] and Neighbor-Net [B], both implemented 
in the SplitsTree package [7 and widely used for visualising phylogenetic data. 
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By looking at diversities, rather than metrics or distances, our hope is to incor- 
porate more information into the analysis and thereby improve inference [8]. 

Dress et al. [5] coined the term T-theory for the field of discrete mathemat- 
ics devoted to the combinatorics of the tight span and related constructions. 
Sturmfels [TU] highlighted T-theory as one area where problems from biology 
have led to substantial new ideas in mathematics. Contributions to T-theory 
include profound results on optimal graph realisations of metrics [H [TTJ IT2"] ; 
intriguing connections between the Buneman graph, the tight span and related 
constructions [9] [T21 - H6J : links with tropical geometry and hyperdeterminants 
[T71 [T5] ; classification of finite metrics [H [115] ; and properties of the tight span 
for special classes of metrics [2UJ [ST] . Hirai [22] describes an elegant geometric 
formulation of the tight span. We believe that there will be diversity analogues 
for many of these metric space results. 

Our use of the term diversity comes from the appearance of a special case 
of our definition in work on phylogenetic and ecological diversity [HI 
However diversities crop up in a broad range of contexts, for example: 

1. Diameter Diversity. Let (X, d) be a metric space. For all A E V^ n (X) let 

S(A) — max d(a,b) — diamM). 

Then (X, 5) is a diversity. 

2. L\ diversity. For all finite A C ffl 1 define 

5(A) = y^max{|ai - 6 4 | : a, b e A}. 

^ — ' a, b 

i 

Then , 5) is a diversity. 

3. Phylogenetic Diversity. Let T be a phylogenetic tree with taxon set X. 
For each finite A C X, let 5(A) denote the length of the smallest subtree 
of T connecting taxa in A. Then (X, 5) is a phylogenetic diversity. 

4. Length of the Steiner Tree. Let (X, d) be a metric space. For each finite 
A C X let 5(A) denote the minimum length of a Steiner tree connecting 
elements in A. Then (X, 5) is a diversity. 
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5. Truncated diversity. Let (X,5) be a diversity. For all A G T\ n (X) define 

6 {k) (A) = max{<5(B) : \B\ < k, B C A}. 

For each fc > 2, (X, £W) is a diversity. Note that these diversities can be 
encoded using OdXl' ) values, an important consideration for the design 
of efficient algorithms. 

The generalisation of metrics to more than two arguments has a long history. 
There is an extensive literature on 2-metrics (metrics taking three points as 
arguments); see |26j . Generalised metrics defined on n-tuples for arbitrary n go 
back at least to Menger [27] , who took the volume of an n-simplex in Euclidean 
space as the prototype. Recently various researchers have continued the study of 
such generalised metrics defined on rt-tuples; see [2"5H3"U] for examples. However, 
as of yet, a satisfactory theory of tight spans has not been developed for these 
generalisations. 

Dress and Terhalle |31j developed tight span theory for valuated matroids, 
which can be viewed as an n-dimensional version of a restricted class of met- 
rics. They demonstrated intriguing links with algebraic building theory. One 
significant difference is that, for diversities, the tight span consists of functions 
on Van(X) rather than on X, as is the case for valuated matroids. 

We note that our results differ from all of this earlier work because, for a 
diversity (X, 6) , the function S is defined on arbitrary finite subsets of X rather 
than tuples of a fixed length. 

The structure of this paper is as follows: In Section [2] we develop the basic 
theory of tight spans on diversities, defining the appropriate diversity for a tight 
span and showing that every diversity embeds into its tight span. In Section [3] 
we characterise diversities that are isomorphic to their tight spans. Here, iso- 
morphism is defined in analogy to isometry for metric spaces. These are the 
hyperconvex diversities, a direct analogue of metric hyperconvexity. We prove 
that diversity tight spans, like metric tight spans, are injective, and are formally 
the injective envelope in the category of diversities. In Section [4] we explore in 
more detail the direct links between diversity tight spans and metric tight spans. 
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We show when the diversity equals the diameter diversity (as defined above) the 
diversity tight span is isomorphic to the diameter of the metric tight span. In 
Section [5] we study the tight span of a phylogenetic diversity, and prove that 
taking the tight span of a phylogenetic diversity recovers the underlying tree 
in the same way that taking the tight span of an additive metric recovers its 
underlying tree. This theory is developed for real trees. Finally, in Section [6] we 
examine applications of the theory to the classical Steiner Tree problem. Dress 
and Kruger 32 defined an abstract Steiner tree where the internal nodes did not 
have to sit in the given metric space. They proved that these abstract Steiner 
trees can be embedded in the tight span. We extend their results to Steiner trees 
based on diversities, thereby obtaining tight bounds for the classical Steiner tree 
problem. 

2. The tight span of a diversity 

We begin by establishing some basic properties of diversities. Recall that 
V& n (X) denotes all the finite subsets of the set X and that a diversity is the 
pair (X,5) where X is a set and the function 6: V& n (X) — > 5ft satisfies axioms 
(Dl) and (D2). 

Proposition 2.1. Let (X, 6) be a diversity. 

1. // d: X x X — > 5ft is defined as d{x 1 y) — 5({x,y}) then (X,d) is a metric 
space. We say that (X, d) is the induced metric of {X, 6) . 

2. (D3) holds, that is, for A,B e V &Yl {X), if A C B then 5(A) < 6(B). 

3. For A,B e T &n (X) ifAnB^® then 5 (A U B) < 6(A) + 6(B). 

Proof. 

1. That d(x,y) = if and only if x = y follows from (Dl). Symmetry is 
clear. To obtain the triangle inequality, for any x, y, z G X, 

d(x, z) = 5({x, z}) < 6({x, y}) + 6({y, z}) = d(x, y) + d(y, z), 

using (D2). 
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2. First note for any a £ A and b G X that by (D2) with C empty 

6(A) < 6(A U {&}) + 5({b}) = S(A U {&}). 

The more general result follows by induction. 

3. Using (D2) 

5{A UB)< S(A U (An B)) + S(B U(An B)) = 5(A) + 8(B). 

a 

We now state the diversity analogue for the metric tight span. 

Definition 2.2. Let (X,d) be a diversity. Let Px denote the set of all functions 
f: Vfin(X) -> satisfying /(0) = and 

J2 f{A) > S( |J A) (2.1) 

for all finite C V Rn (X). Write f < g if f(A) < g(A) for all finite AC X. 
The tight span of (X, S) is the set Tx of functions in Px that are minimal under 

Example. Any diversity £onX = {l,2,3}is determined by the four values 

d 12 = 6({1,2}), d 23 = S({2,3}), di3 = 5({l,3}), d 123 = <5({l,2,3}). 

We write fi = /({^}), fa = f({i,j}) and / 123 - /({1,2,3}) for i,j G X. 
Condition (|2.1[) then translates to the following set of inequalities: 





h 


> 







fij 


> 


d i:j 
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+ fi 
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djj 




/l23 
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d\2Z 




" fjk 
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dl23 


h 


+ h 


> 


dl23 
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for distinct k € X. Note we have omitted inequalities like + fjk > (I123 



since these are implied by (2.2 1 and the triangle inequality (D2). The elements 



of Tx are the minimal / in Px- Equivalently, Tx is the set of / that satisfy 



(2.2 1 and such that for each nonempty A C X, Ja appears in an inequality in 
( [2~2| ) that is tight. 

A straightforward but tedious analysis of the inequalities (which we omit) 
gives the following characterisation of Tx ■ Define the three 'external' vertices 

v {1) = (0,d 12 ,d 13 ) 

= (d 12 ,0,d 23 ) 

= {d 13 ,d 23 ,0) 

and the four 'internal' vertices 



(0) 


= (^123 ^ 


- ^23, ^123 — di3, di2 3 — d\ 2 ) 


(1) 


= «(»)- 


03, 0,0) 


(2) 


= «<»)- 


(0,0,0) 


(3) 


= «(°)- 


(0,0,0), 



where /3 = max(2e?i23 — d\ 2 — d 2 $ — di3,0). Let C be the cell complex formed 
from the line segments [u^, [u^ 2 \v^], [u^jV^] and the solid tetrahe- 
dron with vertices . . . , u^ 4 ). A case-by-case analysis gives that / <E Tx if 
and only if (fx, f 2 , f 3 ) £ C, f 23 = max(d 2 3, ^123 - /1), /13 = max(di 3 , d 123 - h), 
f 23 = max(di2,di23 — f 3 ), /123 = di 23 . If P = then u^' to coincide, and 
the tight span is one-dimensional and resembles the metric tight span for the 
induced metric, albeit sitting in a higher dimensional space (Figure When 
/3 > the tight span resembles a tetrahedron with three spindles branching off, 
as in Figure [TJd. □ 

We now prove a characterisation of the diversity tight span which will be 



used extensively throughout the remainder of the paper (Theorem 2.3). An 
equivalent result holds for the metric tight span [4j Theorem 3(v)]. 
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1>2 



ru ,. ■ ■ , «3 




Figure 1: Two examples of the tight span on three points, with different values for <f({l, 2, 3}). 
On the left an example where 2d\2i < dl2 + ^23 + ^I3i and the diversity tight span is one- 
dimensional and resembles the tight span of the induced metric. On the right a case with 
2e£i23 > <ii2 + ^23 + ^13, where the diversity consists of a three-cell with three adjacent 
one-cells. 



Theorem 2.3. Let f: V& n (X) -> 5ft and suppose /(0) = 0. Then f G T x if and 
only if for all finite A C X , 



f(A)= sup [s(AU M B)- V /(B): |#|<ool 



(2.3) 



Proof. 



Suppose that f &T X - As /(0) = 0, the supremum in (2.3) is well defined. For 
all finite A C X and all finite SB C 'Pfin(^) we have 

/(A)>«(AU |J B)- £ /(B), 

BeJ Bess 

giving the required lower bound on f{A). Now suppose that for some finite Aq 



sup \s(A U |J B) - J2 f(B): \®\ < oo 1 



/(A)) > 



Define a function g : 7- > fi n (X) — > 5ft>o by 



9(A) = 



f{A) 



(2.4) 



if A £ A 



Clearly g ^ f and g ^ /. We show that (/ is in Pj. Let jz/ be a finite subset of 
Vtin(X). If A ^then 



£ ff (A) = £ /(A) > f( |J A). 
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If A £ srf then 

E 9(A) = sup h(A U \J B) f(B):M<™\+ E H B ) 

Aea? SSCVnn(X) [ Bf -ag BeSg J Bee/\{A } 

> 5(A U (J B) 
Be^\{A } 

= K U A )' 

Aestf 

by letting 38 = s$\ {A }. So g £ Px, g ^ f and g < f, contradicting / eT X - 



Hence there is no Aq satisfying (2.4). If / € Tx then (2.3) holds for all finite 
A CX. 



For the converse, suppose that (2.3) holds for all finite A C X. Then / £ 
Px- Suppose that g £ Px, that g di f and A £ V& n (X). Then for all finite 
@ C Pfm(^) we have 

^U[JB)-E ^ <H AU U B ) E ^ 9(A) 

Bess Bess Bess Bess 

so that f(A) < g(A). Hence / is minimal in Px- □ 



We note that the characterisation of tight spans given by Theorem |2.3| is 
analogous to the definition of tight spans for valuated matroids used by [31 j - 
One important difference is that, for diversities, the tight span is made up of 
functions on Va n (X) rather than functions on X. 

The following basic properties of members of Tx will be used subsequently. 

Proposition 2.4. Suppose that f £ Tx- 

1- f(A) > 5{A) for all finite A C X. 

2- If A C B C X and B is finite then f(A) < f(B); that is, f is monotone. 

3. f(A U C) < 5(A UB) + f(B U C) for all A,B,C £ V &n (X) with 5^0. 

4. /(A US) < f(A) + f(B) for all A,B £ V Rn (X); that is, f is sub-additive. 

5. f(A) = sup B {5(A U B) — f(B) : B £ V Rn {X)} for all finite A. 

Proof. 

1. Use = {A} in the definition of Px- 
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2. Follows from (2.3) and the monotonicity of 6. 



3. Let 4,B,Ce Pfm(X) with B ^ 0. We have 



/(AUG) = sup |<S(AUCU M £>)- J2 f(D): \9\ < oo 1 (2.5) 
f(BUC) = sup J*(BUCU M D)- V /(£>): |#| < oo 1.(2.6) 



Subtracting (2.6) from (2.5) gives 



f(AuC) -f(BUC) < sup \s(AuCU (J £>) -«(BUCU (J D): \2>\ < oo 
< (5(iUB), 

by taking 2) = and using the triangle inequality. We note that this property is 
analogous to the continuity of functions in the metric tight span, see [H Theorem 
3(iv)]. 

4. Given any A,BG Va n (X) and any finite collection c € C V^ n {X) we have 

f(A) + f(B) + f(C) > S(A U B U |J C) 

so that 

f(A) + f(B) > sup ls(AUBU U C)- ^ /(C): < oo 1 
= /(AUB) 

by Theorem |2.3| 

5. For any finite & C 7> fin , £ Be ^/(A) > fi^BeSsB). So 

sup i^U M B)- V /(B): |#| <ool = sup {S(A U C) - f(C)} . 
»e7W*) { B&m B&m J cev !in (x) 

a 



The distance between any two functions /, g in the metric tight span is given 
by the norm, 

d T {f,g) = sn V \f{x)-g{x)\ (2.7) 

xGX 
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which Dress [H Theorem 3(iii)] shows is equivalent on this set to 

d T (f,g)= sup {d(x,y)-f(x)-g(y)}. (2.8) 

x.yEX 

Dress also showed that a metric can be embedded into its tight span using the 
Kuratowski map k, which takes an element iGlto the function h x for which 
h x (y) = d(x,y) for all y. This is exactly the map e defined in [3J section 2.4]. 

Here we establish the analogous results for the diversity tight span. We 
define the appropriate function k from a diversity to its tight span. We then 
define a function St on Tx so that (Tx,6t) is a diversity and prove that n is 
an embedding. 

Definition 2.5. 1. Let (Yi,8i) and (12,^2) be two diversities. A map it: Y\ —J 

Y2 is an embedding if it is one-to-one (injective) and for all finite A C Y\ 
we have 81(A) = <J 2 (7r(.<4)). I n this case, we say that n embeds (Yi,S\) in 
(Y 2 ,S 2 ). 

2. An isomorphism is an onto (surjective) embedding between two diversities. 

3. Let (X, 5) be a diversity. For each x £ X define the function h x : 
TUX) ^Kby 

h x (A) = S(AU {x}) 

for all finite A C X . Let k be the map taking each x £ X to the corre- 
sponding function h x . 

4. Let (X, S) be a diversity. Let St '■ T > Sn(Tx) — > 3? be the function defined by 
5 T (0) = and 

5 T (F) = sup { S I M A ] - V inf f(A) : M < 00 |>(2.9) 



stfCV fiD (X) 



U A E inf /(A):K|<oo|(: 



for all finite non-empty F C Tx- 



Further manipulations give a form for St analogous to (2.8) 



S T (F)= sup 18 [\jA f - ^ : Ay e ^ fin (X) for all / e F 



for all finite F C T^finCTx). We can also re-express (2.9) in a form closer to (2.7) 



Note the similarity between Lemma 2.6 and [4j Theorem 3(iii 
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Lemma 2.6. If f e F then 



St(F) = sup 

a/CP, 



P \f I i J A I - Y ">f ff(^) : M < 



Proof. 

*r(F) = sup \s( |J iU |J B ) - ^ mf - ]T /(B) : K|, |0| < oo 

*,»C7%n(*) [ \ Ae ^ B< -^ J Ae ^9€F\{f} ^ 

sup |/([JA)-V inf 5 (A) : \st\ < oo I , 



by Theorem 2.3 □ 



Theorem 2.7. (Tx,<5t) «s a diversity. 
Proof. 

First note that for all F CT X , when ^ = {0}, 



so that 5t is non-negative. 

If ^ F C G then for all ,sz/ C 7> fin W with |.e/| < oo we have 

Ae#/ Ae.es? 
Hence 5t{F) < St(G), showing that 5t is monotone. 
If F = {/} then 

S T (F) = *r({/}) < sup {<$ (A) - /(A) : A e V &n (X)} = 



by the subadditivity of / and by part 1 of Proposition |2.4| On the other hand, 
if \F\ > 1 then there is /i,/a € F such that /i ^ /2- By monotonicity and 
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Lemma 12.61 we have 



St(F) > *r({/i,M) 



sup 

sup {h(A)-h{A)} 
AeVn n (X) 



< oo 



> 0. 



We have now proved that St satisfies (Dl). 

For the triangle inequality, suppose F and G are disjoint finite subsets of Tx 



and that h G T x \ (F U G). Then by Lemma 2.6 



5 T (FU{h}) = sup \h I \J A) - ]T ™lf( A ) ■■ M < ^(P ) 



and 



S T (GU{h}) 



sup \h( \J b)-J2 inf 5(5) : |^| < oo kll) 

*ct%»(x) [ Vse® / Bess 9 ^ J 



By part 4 of Proposition 2.4 the function h is sub-additive, so 



Vag^ / \BeSS J J 



(2.12) 



Combining (2. 10) -(2. 12) and again applying Lemma 2.6 we have 



5 T (FU{h}) + S T (GU{h}) > sup \hl \J C)-J2 M f(C):\C\ 



< oo 



6 T (FUGU{h}). 



The triangle inequality (D2) now follows by monotonicity. 



□ 



Theorem 2.7 establishes that (Tx,St) is a diversity. We now show that n is 



an embedding from (X, S) into (Tx,St)- We then prove the diversity analogue 
of [31 Eq. (2.4)] (see [H Theorem 3(h)]) and characterise 5t in terms of a 
minimality condition. 
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Theorem 2.8. 1. The map n is an embedding from (X,5) into (Tx,St)- 

2. For all finite Y C X and f G Tx, 

6 T ( K (Y)U{f}) = f(Y). 

3. // (Tx,S) is a diversity such that S(k(Y) U {/}) = f(Y) for all finite 
Y C X and f G Tx then 

5{F) > S T (F) 

for all finite F C Tx ■ 
Proof. 

1. Fix x <E X. Consider finite si C V& n {X). The triangle inequality for 
diversities, (D2), gives 

£ M^)= E *(AU {«})>*( (J A] , 

so that /la € Px- There is g € such that g <h x . Since /^({x}) = = 

we have for all finite A C X that 

h x (A) = S(Au{x}) < g(A)+g({x}) < g(A)+h x ({x}) = g(A) < h x (A). 

Hence h x = g G Tx ■ 

To see that k is one-to-one observe that for x ^ y, ^({x}) = but /ij,({a;}) = 
5({x,y}) > 0. So /i x ^ h y for distinct a;, y G X. 

We now show that <5 t (k(T)) = 5(F) for all finite F C X. Let 7 C X, 



Y = {y 1 ,.. .,y k }. Taking si = {{ Vl }, {y k }} in (2.9| gives S t (k(Y)) > S(Y) 



By repeatedly using the triangle inequality we have for any finite 
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{Ai,A 2 , . . . , Aj} C Vfin(X) and Zi, . . . , Zj € Y that 

S(Y) > S(YUA 1 )-5({z 1 }UA 1 ) 

> 5{Y U A 1 U A 2 ) - 5{{z x } U A x ) - <5({z 2 } U A 2 ) 



i=l / i=l 
3 \ 3 



Vi=l / i=l 
^ 3 \ 3 



Taking the supremum over all such srf and applying (2.9 1 gives 5j>(k(Y)) < 5(Y). 
So 5t(k(Y)) = S(Y) and k is an embedding. 

2. Let Y C X, Y finite, and / e Tx- If / = h y for y S y then, using part 1, 

<5 T («(r) U {/}) = S T (n(Y)) = 6(Y) = J(Y U {y}) = f(Y), 

as required. Otherwise, suppose / «(Y). Let Y = {yi, . . . , y^}. 

S t (k(Y) U {/}) = ^ jup |<J MJ A, U Ay J - £ U Aj) - /(Ay) | . 

Letting Aj = {yi} for all i shows 

S t (k(Y) U {/}) > sup{<5(r U Ay) - /(Ay)} = f(Y), 
Af 



by Proposition |2.4| part 5. On the other hand, following the same reasoning as 
in part 1 of this proof shows 

6 t (k(Y) U {/}) < sup<5(Y U Ay) - /(Ay) = f(Y). 

As 

Therefore S t (k(Y) U {/}) = f(Y). 

3. Suppose that F = n(Y) U G, where Y G ^(X) and G C T x \ k(X). 
For all collections srf C ■Pfin(X) with |,sz/| < oo and all collections {fA}Ae<tf of 
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elements in F, we have from 1. and 2. that 



5[YU |J A) ^(A) = sL(Y)U (J k(A))- £ U {&}) 



< 5( K (Y)U{f A :Ae^}) 

< S(k(Y)UF). 



a 



3. Hyperconvex diversities and the injective envelope 

Aronszajn and Panitchpakdi pQ introduced hyperconvex metric spaces and 
showed that they are exactly the injective metric spaces. 

Definition 3.1. 1. A metric space (X,d) is said to be hyperconvex if for 
all r: X — > with r(x) + r(y) > d(x, y) for all x,y G X there is a point 
z £ X such that d(z, x) < r{x) for all x € X . 
2. A metric space (X, d) is injective if it satisfies the following property: given 
any pair of metric spaces (Yi, d\), (Yi, ^2); in embedding n : Yi Y<x and 
a non-expansive map <f> : Y± — > X there is a non-expansive map ip : Yi X 
such that 4> — tp o 7r. 

See [2] for a proof of the equivalence of these two concepts, as well as a highly 
readable and comprehensive review of the rich metric structure of hyperconvex 
spaces. Here we establish diversity analogues for these concepts and show that 
the equivalence holds in this new setting. We begin by defining diversity ana- 
logues of injective and hyperconvex metric spaces. 

Definition 3.2. 1. Given diversities (Yi, 8±) and (Y2, 62), a map (ft : Y± Y2 
is non-expansive if for all A C Y\ we have 5%{A) > <5 2 (</>(.A)) and it is 
an embedding if it is one-to-one and for all A C Y\ we have Si(A) = 
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2. A diversity (X, S) is injective if it satisfies the following property: given 
any pair of diversities (Y±,6i), (^2,(^2); a,n embedding 7T : li — > Yi and a 
non-expansive map cf> : Y\ X there is a non-expansive map ip : Y% X 
such that (f> = ip o 7r. 

3. A diversity (X,S) is said to be hyperconvex if for all r: Vn n (X) — > 5ft such 
that 

S 



U^)<E <A) (3.1) 



for all finite C V^iX) there is z £ X such that S({z} U Y) < r(Y) for 
all finite Y Q X . 

The following theorem establishes the diversity equivalent of Aronszajn and 
Panitchpakdi's result. 

Theorem 3.3. A diversity (X,8) is injective if and only if it is hyperconvex. 
Proof. 



First suppose that (X, S) is injective. Consider r: Vfi n (X) — > 5ft satisfying (3.1| 
for all finite £/ C VR n (X). Without loss of generality we can assume r(0) = 
and hence r G Px- Choose / £ Tx with / <r. 

Let x* be a point not mX, let X* = XU{x*} and let S* : P Sd (IU{i*}) -> 5R 
be the function where for all finite A C X, 

5* (A) = 5(A) 
5*(A(J{x*}) = f(A). 

From Proposition |2 ,4| part 2 we have that 5* is monotonic, and from part 4 and 
5 we have that 

5*{AuCU{x*}) < 6*(AU{x*}) + 5*(CU{x*}) (3.2) 
6*(AuBUCU{x*}) < S*(AuBU{x*}) + 6*(BUC). (3.3) 

for all finite A, B,C C X such that B 7^ 0. These, together with monotonicity 
and the fact that 5* coincides with S on Vfi n (X), imply the triangle inequality 
(D2) for (X*,S*). 
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We now apply the fact that (X, 6) is injective. Let (Y\,5i) be (X, 5); let 
(Y 2 ,5 2 ) be (X*,S*), let w be the identity embedding from (X, 5) into (X*,5*) 
and let <fi be the identity map from (X, S) to itself. Then there is a non-expansive 
map <f> : X* — > X such that 0(x) = a; for all x e X. 

Let w = <j>{x*). For all finite ^4 C X we have 



This proves that (X, 5) is hyperconvex. 

For the converse, suppose now that (X, 5) is hyperconvex. Let (Yi,£i) and 
(Y 2 ,5 2 ) be two diversities, let ir : Y\ — > Y2 be an embedding and let be a 
non-expansive map from Yi to X. We will show that there is non-expansive 
ip : Y 2 — > X such that </> = -0 o 7r. 

Let ^ denote the collection of pairs (y V'y) such that 7r(Fi) C F C Y 2 and 
V>y is a non-expansive map from Y to X such that = ipy We want to 
show that Y2 € Suppose this is not the case. We write (Y,iPy) <! {Z,ipz) 
if y C Z and ^ restricted to V equals ^y- The partially ordered set (^, <) 
satisfies the conditions of Zorn's lemma, so it contains maximal elements. 

Let (Y, ipy) be one such maximal element. Choose y e Y 2 \Y. For each 
finite A C y let r{A) = S 2 (A U {y}). For any finite collection ^ C Va n (Y) we 



5(4 UM) < 5*(AU{x*}) 



!(A) 



< r{A). 



have 




< ^« 2 (iU{|,}) 
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Since (X, 5) is hyperconvex, there isiel such that 



Clearly, 



6(<p Y (A) U {x}) < r(A) = S 2 (A U {y}) 

for all finite AQY . Hence we can extend to FU {y} by setting tpy(y) = x, 
giving a non-expansive map from YL){y} to X, and contradicting the maximality 
of Y. 

It follows that Yz € <3f , proving that (X, 6) is injective. □ 
Definition 3.4. Let (X, 6) be a diversity. For F C Tx and finite Y C X let 

♦'0-> " -d£ m { £ : M < °°' U/ " y } ■ 

5 T {F) = sup {S(Y) - $ F (Y) : |Y| < oo}. (3.4) 
We show that $f also satisfies a sub-additivity type identity. 
Lemma 3.5. For F,GCT X and Y, Z C P fin (X) we /wrae 
$ Fu g(^UZ) <$ F (F) + $ G (Z). 

Proof. 

Given e > there is finite srf C 7 7 fi n (X) and a collection of elements 

in Tjc such that 

*f{Y) < Y,f A (A) < $ F (F) + e/2. 

Ae&f 

Similarly, there is finite &§ C 7 3 fi n (^) and a collection {gs^Bess of elements in 
Tx such that 

*g{Z) < ^<?b(#) < $ G (Z)+e/2. 

Bess 

Define — si ' U ^ and the collection {hc}cetf by 

/c if C G ^; 
5c otherwise. 
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h c = < 



Then 

* F (Y) + * a (Z)+e > M A )+ E 9b{B) 

Ae#/ Bess 

> Em c ) 

> <S> FUG (YUZ). 

Taking e — > proves the Lemma. □ 

Isbell proved that the metric tight span is injective, and hence hyperconvex 
O Section 2.9]. Here we prove the same result for diversities. 

Theorem 3.6. For any diversity (X,5), the tight span (Tx,6t) is hyperconvex. 



Proof. 

Let r : V Ru (T x ) -> 3? be given such that for all finite & C V Rn (T x ) 



E K* 1 ) > <*r ( LM 



Without loss of generality we can assume r(0) = 0. We need to find g E Tx so 
that 5 T (G U { 5 }) < r(G) for all G C Tx- 
Define u on ■Pfi n (X) by 

w(A) = inf {r(F) + $ F CA) : |F| < oo}. 

We have w(0) = 0. Suppose that si C ^(X), K| < oo and let {Fa ■ A e 
&/} be a collection of finite subsets of Tx indexed by elements of From 



Lemma 3.5 and (3.4 1 we have 



s(\Ja) < sJ]Jf a ) + ^ Fa) (\Ja 

< ew^) + $f a (a)), 



AGs 

so that 
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and u! G Px ■ 

There is g G Tx such that g -< u. Consider finite F C Tx- Applying 
Lemma |2.6| 

5 T (FU{ ff }) = sup {g(A)- ® F (A)} 

A£P tin (X) 

< sup {(r(F) + <S> F {A))-$ F (A)} 
= r(F), 

as required. □ 



The metric tight span construction gives an isometric embedding k from a 
metric space (X, d) into an injective (hyperconvex) metric space. Isbell showed 
that this embedding is minimal in that no proper subspace of the tight span 
both contains n(X) and is injective. Such an embedding is called an injective 
envelope, and all injective envelopes of a metric space are equivalent [3J Thm 
2.1]. 

Here we prove the analogous result for diversities that the embedding n of 
a diversity into its tight span is also an injective envelope. 

The class of all diversities with all non-expansive maps as morphisms forms a 
category, which we will denote Dvy. The definitions of embeddings and injective 
objects then correspond to concepts in category theory, as reviewed in [33j . 



Lemma 3.7 together with the injectivity of (Tx, St) establishes that (Tx,St) is 
the injective hulloi (X, S) in the category Dvy (33J pg. 156]. Proposition 9.20(5) 
of [33] demonstrates the equivalence between the category theory injective hull 
and the injective envelope introduced in [3]. 

Lemma 3.7. Let cj> be a non-expanding map from (Tx, St) to diversity (Y, Sy)- 
If tt = 4>o k is an embedding from (X,S) to (Y,Sy) then (f> is an embedding from 
(T X ,S T ) to (Y,S Y ). 

Proof. 

Since 4> is non-expanding St(F) > 6y(4>(F)) for all finite F C Tx- Using 
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Theorem 2.8 part 3 we will show that 5t{F) < 6 Y (4>(F)), so that </> is an 
embedding. 

Consider / G T x . Define g on V Rn {X) by g{A) = 5 Y (ir(A) u <K{/})) for a11 
finite A. Then for any finite ACIwe have 

g(A) = 5 Y (ir(A) U #{/})) = <5y(0(«(A) U {/})) < «5 t (k(A) U {/}) = /(A) 

for all A. For all finite collections C Pg^X) we have 

2 = ^ M^4)U </>({/})) 



> S Y ( |J tt(A) j 



so that g e Px and # ^ /. Hence g(A) = /(A) for all finite AC X. It follows 
that 

5 Y (n(A)um})) = 5MA)U{f}) 

for all / G T x and finite ACI 

Define 6 on T x by 5(F) = 5 Y ((f>(F)). Then ? is a diversity and ?(/t(Y) U 



{/}) = f(Y) for a11 nni t e Y C X. By Theorem |2.8| 5(F) > S T (F) for all finite 
F. 

As <j) is non-expansive 6t(F) = S Y (4>(F)) for all finite F and </> is an embed- 
ding. □ 



The following Theorem is a translation of [33J Proposition 9.20(4)] to diver- 
sities. 

Theorem 3.8. If there is an embedding n from (X, 5) into (Y, S Y ) and (Y, 5 Y ) is 

injective (hyperconvex) then there is an embedding <fi from (Tx,8t) into (Y,S Y ) 
such that 7r = <f> o k. 

Proof. 

Since 7r is a non-expansive map, (Y,S Y ) is injective, and n is an embedding 
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of (X,S) into (Tx,St), there is a non-expansive map <f> : Tx — > Y such that 
7r = (j) o k. By Lemma 3.7 <j) is an embedding. □ 



Corollary 3.9. Let (X, 5) be a diversity. The following are equivalent: 

1. {X, S) is hyperconvex; 

2. (X, 5) is injective; 

3. There is an isomorphism between (X,S) and its tight span, (Tx,St)- 
Proof. 



1. and 2. are equivalent by Theorem 3.3 To see that 2. implies 3., let (Y, Sy) 



(X, 5) and n = id in Theorem 3.8 Then there is an embedding from (Tx, St) 
to (X, 5) such that <f> o k — id. So k is surjective and 3. follows. Finally, since 
hyperconvexity is invariant under isomorphism, 1. follows from 3. □ 



4. Tight span of the diameter diversity 

In this section we prove that tight span theory for metrics is embedded within 
the tight span theory for diversities. The link between the two is provided by 
the diameter diversity as introduced above. 

Definition 4.1. Given a metric space (X,d) we define the function 6 — diam^ 
by 

6(A) = diam^yl) = max{(i(a, a') : a, a' € ^4} 

for finite A Q X , with dianid^) = 0. We call (X, diam^) the diameter diversity 
for (X, d) . Note that if we restrict diam^ to pairs of elements we recover d as 
the induced metric. 

We will establish close links between tight spans of metrics and tight spans 
of their diameter diversities. 
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(X,d) (Ij, (It) 



S— diam d 



S T — diam d 



(X,S) 



tight spa 



(T S X ,5 T ) 



Lemma 4.2. 1. Let (Y,S) be a diversity with induced metric (Y,d$). Let 
(X, d) be a metric space and let (X, diani^) be the associated diameter 
diversity. Then 4> is a non-expansive map from (Y, 5) to (X, diam^) if and 
only if it is a non-expansive map from (Y,d$) to (X,d). 

2. A metric space (X,d) is infective (hyperconvex) if and only if the diameter 
diversity {X, diam^) is infective (hyperconvex). 

3. The tight span (T^, St) of a diameter diversity is itself a diameter diver- 
sity. 

Proof. 

1. Suppose that is a non-expansive map from (Y, S) to (X, diam^). For all 
2/i , 2/2 € Y we have 

^(2/1,2/2) = £({2/1,2/2}) > diam d ({0(yi), 0(y 2 )}) = d(<j>{y{), 4>{y 2 )), 

so <p is non-expansive from (Y, ds) to (X, d). Conversely, suppose is a non- 
expansive map from (Y, ds) to (X, d). Then for any finite A C Y we have 

5(A) > sup{Gfo(ai, a 2 ) : ai, a 2 G A} 

> sup{rf(0(ai), 0(a2)) : 01,02 € A} 
= diam d ( ( /)( J 4)). 

2. Suppose that (X, d) is injective. Let (Yi,£i), (^2,^2) be two diversities with 
induced metrics d\,d2- Let 7r be an embedding from (Yi,Si) into (^2,^2) and 
let (!) be a non-expansive map from (Y\,Si) to (X, diam^). Then ir embeds 
(Yi,di) into (Y2, d 2 ), and by part 1., <j> is a non-expansive map from (Yi,di) to 
(X, d). As (X, d) is an injective metric space there is a non-expansive map if 
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from (Y 2l d 2 ) to (X, d) such that 4> — i> n , which by part 1. is a non-expansive 
map from (Y^diam^) to (X, dianid). Since 82(A) > diam £ ; 2 (A) for all A, tp is 
non-expansive from (Y 2 ,<5 2 ) to (X, cham^). Hence (X, diam d ) is injective. 

Conversely, suppose (X, diam^) is an injective diversity. Let (Yi, di), (Y 2 , d 2 ) 
be two metric spaces, let tt be an embedding of (Y"i,di) into (Y 2 ,d 2 ), and let 
<f> be a non-expansive map from (Y\,d\) to (X, <i). Then <j> is a non-expansive 
map from ( Yi , diaru^ ) to (X, diam d ) and since (X, diam d ) is injective, there is 
a non-expansive map tp from (Y2, diam<i 2 ) to (X, diany) such that <j> = -0 o tt. 
Applying part 1. again, we have that tp is the required non-expansive map from 
(Y 2 ,d 2 ) to (X, d). Hence (X, d) is injective. 

3. Since (X,8) is a diameter diversity, for any finite F and {Af}f G p C V^ n (X), 
we have 

for some 

fi, h G ^- Hence for finite F C T& 



= max sup {<5(i4iUi4 2 )-/i(Ai)-/ 2 (^2)} 

/i,/2e-F J 4i,A 2 e'p fin (x) 

= max ^t({/i,/ 2 })- 

□ 



Theorem 4.3. Lei (X, rf) 6e a metric space with metric tight span (T%, dr)- Let 
(X, 8) be the associated diameter diversity where 8 = diam^, and let (T^, St) be 
its diversity tight span. Then 

1. The metric space obtained by restricting St to pairs in T^ is isometric to 
the metric space (T^,dr)- 

2. The diversity obtained by taking the diameter on the metric space (T%, dr) 
is isomorphic to the diversity (Tx,St)- 
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Proof. 

First note that for any metric spaces (Xi, d\) and (X 2 , d 2 ) a map <p from X\ to 
X 2 is an embedding from (Xi, di) to {X 2 , d 2 ) if and only if <fi is an embedding 
from (X\ , diam ( i 1 ) to {X 2 , diam^j ) . 

Let (Ty , ) be the diameter diversity associated to (Tj| , c?t) and let (T£ , ds T ) 
be the induced metric for (Tj^, <5t). Let /c<j be the Kuratowski embedding from 
(X,d) to (T^jdr)- Then is also an embedding from (X, 6) to (T^,Sd T ). In 
the same way, let K5 be the Kuratowski embedding from (X, 6) to (T x ,5t)- 
Then k$ is also an embedding from (X, d) to (Tj^, d<5 T ). 



By Lemma 4.2 2., (T|,^ T ) is a hyperconvex diversity and (T x , dg T ) is a 
hyperconvex metric space. Applying [33, Proposition 9.20(4))] in the category 
Met there is an embedding <j> from (T^,g?t) to (T^,ds T ) such that 

K$ = (j)0 K d . (4.1) 

The identity map id T ^ on (T^, dr) is non-expansive and <j> is an embedding, 
so applying the definition of injective metric spaces to (T^,c?t) we have that 
there is a non-expansive map tp from T$- to T% such that 

ipo(f) = id T d . (4.2) 



By Lemma 4.2 3., the diversity (T^St) is a diameter diversity and so from 
1., the map ij) is also a non-expansive map from (T_y, i5t) to (T^, 8d T )- 



Lemma 



4.2 



Combining (4.1) and (4.2) we have 



which is an embedding. By Lemma |3.7| we have that tp is an embedding, im- 
plying that that <fi is both an isomorphism from (T^dr) to (T^,dg T ) and an 
isomorphism from (T^,5d T ) to (T^,5t)- □ 
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5. Phylogenetic diversity 

A metric space {X, d) is additive or tree-like if there is a tree with nodes 
partially labelled by X so that for each x, y £ X the length of the path (including 
branch- lengths) connecting x and y equals d(x, y). Dress [5] showed that if (X, d) 
is additive then its metric tight span corresponds exactly to the smallest tree 
it can be embedded in. The elements of the tight span correspond not only to 
the nodes of the original tree, but also the points along the edges. Here we will 
prove analogous results about phylogenetic diversity. 

Following [4] we will work with real trees (also called metric-trees or -R-trees), 
rather than graph-theoretic trees. 

Definition 5.1. \5Q 

1. Let (X ', d) be a metric space and let x,y be two points at distance d{x, y) = 
r. A geodesic joining x,y is a map c : [0,r] — > X such that c(0) = x, 
c(r) = y and d(c(s),c(t)) = \t — s\ for all s,t £ [0, r]. The image of c is 
called a geodesic segment. 

2. \34[ Defn 2.1] A metric space (X, d) is a real tree or 3t-tree if 

(a) there is a unique geodesic segment [x, y] joining each pair of points 
x, y £ X . 

(b) if [y, x] H [x, z] = {x} then [y, x] U [x, z] = [y, z] . 
Hence ifx,y,z are three points in a real tree then 



Phylogenetic diversity, as introduced by [23] and investigated extensively by 

[SI [HI [5S] and others, can be viewed as a generalisation of additive metrics. 
The phylogenetic diversity of a set of nodes or points in a tree is the length of 
the smallest subtree connecting them, so that the restriction of a phylogenetic 
diversity to pairs of points gives an additive metric. A formal definition of 
phylogenetic diversity on real trees requires a bit more machinery. 

For a real tree (X,d), let /i be the one-dimensional Hausdorff measure on it 
|36j . The important features of fj, for our purposes is that it is defined on all 




[x,y] C [x,z] U [y,z\. 



(5.1) 
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Borel sets, it is monotone, and it is additive on disjoint sets. Furthermore, for 
any points a, b £ X, A*([<2j b]) = d(a, 6), and naturally /x({a}) = 0. See |37| for a 
related measure on real trees. 

Definition 5.2. 1. The convex hull of a set A C X is 

conv(A) = (J [a, b] 

a,b£A 

and we say that A is convex if A = conv(A). 
2. Let (X, d) be a real tree. The real-tree diversity (X , 5 t ) for (X , d) is defined 
by 

S t (A) := ^(conv(A)) 

/or aZZ finite A C Af. JVoie i/ia£ since A is finite, conv(A) is closed and 
hence /x(conv(j4)) is defined. 

First we prove that this phylogenetic diversity satisfies the diversity axioms 
(Dl) and (D2). 

Theorem 5.3. Let (X,d) be a real tree. Then (X, St) is a diversity. 
Proof. 

Since \i is a measure, 5 t is non-negative and also monotonic. If \A\ < 1 then 
conv(A) = A and so S t (A) = fi(A) = 0. If \A\ > 1 then select distinct a,b e A. 
Since conv([a, b]) — [a,b] and /j,([a,b]) = d(a,b) we have 5* (A) > i5t({a, 6}) = 
d(a,6) > 0. This proves (Dl). 



Let A,B,CG Vs n (X) and suppose that B ^ 0. From (5.1) we have 

[a, c] C [a, b] U [6, c] (5.2) 
for all a £ A, b G £> and c G C. Hence 

conv(A U C) C conv(A UB)U conv(S U C) 

and 

Jt(4UC) = p(conv(iUC)) 

< /i(conv(A U £)) + //(conv(S U C)) 
= 5 t (AuB) + 5 t (BUC), 
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giving us the triangle equality (D2). 



□ 



We now show that complete real-tree diversities are hyperconvex, proving 
the diversity analogue of [311 Theorem 3.2]. 

Lemma 5.4. Let (X ', d) be a real tree with associated tree diversity (X , St). For 
all finite C C X and r > 5 t {C), the ball B(C,r) = {x G X : S t (CU {x}) < r} is 
closed and convex. 

Proof. 

For any finite but non-empty C C X the function 

<j> : X ^ M : x ^ 8 t (C U {x}) 
is continuous. Hence when r > S t (C) the ball 

B(C,r) := ^(A) = {x e T d x : 8 t (C U {x}) < r} 

is closed. 

To prove convexity, suppose that x±,X2 G B(C,r). Fix a G C. For all 
y G [a,xi], conv(CU {y}) C conv(C U {xj) and so (5 ( (CU{)/}) < iS t (CU{a;i}) 
showing that y G B(C,r). We have that [a, xj.], and by symmetry [a, X2], are 



contained in B{C,r). By (5.1) we have 

[xi, x 2 ] C [a, xi] U [a, x 2 ] C B(C, r) 
so that B(C,r) is both closed and convex. □ 



Theorem 5.5. Let (X , d) 6e a real tree with associated real-tree diversity (X, St). 
Then (X,St) is hyperconvex if and only if (X,d) is complete. 

Proof. 
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Suppose that (X, d) is a complete real tree. Then (X,d) is a hyperconvex 
metric space [38:, Theorem 3.2]. Suppose that r : Ps n (X) — > 3? satisfies 



has a non-empty intersection. 

First we show that the members of T intersect pairwise. Consider a pair of 
nonempty finite subsets Ai, Aj of X. To show that B(Ai, r(Ai)) and B(Aj,r(Aj)) 
intersect, we show that there is v such that 5 t {Ai U {v}) < r(Ai) and 5 t {Aj U 
{v}) < r(Aj). This clearly holds if there is conv(Aj) D conv(A,) ^ 0. Suppose 
then that conv(Ai) and conv(Aj) are disjoint. Since Ai,Aj are finite, conv(Ai) 
and conv(Aj) are closed subtrees of Tj£. By [351 Ch. 2, Lemma 1.9] there ex- 
ists ai € conv(Ai) and aj S conv(A,) such that [ai,aj] l~l conv(Ai) = {ai} and 
[ai, aj] nconv(Aj) = {aj} and for all x £ Ai and y G Aj we have [ai, aj] C [x, y]. 
Then, 



r(Ai) + r(A,-) > 5 t {Ai\JAj) 

= /j(conv(A, U A 3 )) 

> /i(conv(Ai)) + ^([ai,aj]) + /i(conv(A,)) 

= ^ t (A i ) + d(a i ,a i ) + (5 t (A j ). 

Hence there is v € [a;,Oj] such that d{a,i,v) < J'(Ai) — <5t(Ai) and d(aj,v) < 
r(A,-) - <5 t (A,-), so that 




for all finite C VtiniX). We will show that the collection of balls 



T = {B(A,r(A)):AeV tin (X)} 



S t (AiU{v}) 



5t(Ai) + S t ({ai,v}) 



S(Ai)+d{ai,v) 



and likewise 8t{Aj U {v}) < r(Aj). 
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We have established that T satisfies the pairwise intersection property. The 
closed, convex sets of a real tree satisfy the Helly property 34 , so every finite 
subcollection of T has non-empty intersection. By the completeness of (X, d), 
r has a non-empty intersection, so there is v such that St(A U {«}) < r(A) for 
all A £ Vnn(X). This proves that (X, St) is hyperconvex. 

For the converse, we note that completeness of (X,d) follows directly from 
[H Proposition 3,2] and the definition of hyperconvexity for diversities. □ 



Definition 5.6. A diversity (X,S) is a phylogenetic diversity if it can be em- 
bedded in a real-tree diversity {X, St) for some complete real tree (X,d). 

Clearly, every real-tree diversity is a phylogenetic diversity, but a phyloge- 
netic diversity is a real-tree diversity only if its induced metric is a real tree. 

Theorem 5.7. Let (X, 8) be a diversity. Then (X, 5) is a phylogenetic diversity 
if and only if (Tx, St) is a real-tree diversity. 

Proof. 

Since (X, S) is a phylogenetic diversity there is a complete real tree (X, d) with 
real-tree diversity (X, St) for which there is an embedding (f> from (X,S) into 



(X,5 t ). By Theorem 5.5 {X,S t ) is hyperconvex. By Theorem |3.8| there is an 
embedding ip from (Tx , St) into (X, St) such that (j) = ip o k. 

Let (Tx,dg T ) be the induced metric for (Tx,St)- It follows directly from 
the hyperconvexity of (Tx,St) that (Tx,ds T ) is convex. For any /, g € Tx and 
geodesic segment [/, g] in Tx , the image of [/, g] under ip is the unique geodesic 
segment between an d ip(g)- It follows that 4>{Tx) is a convex subset 

of (X,d) and (X,d) restricted to tp(Tx) is a real tree jSHl pg. 36]. Restricting 
(X, St) to ip(Tx) then gives a real-tree diversity which is isomorphic to (Tx, St)- 

For the converse, note that the map n from (X, S) into its tight span is an 
embedding, so that (X, S) is a phylogenetic diversity. □ 



We now link the real tree given by the diversity tight span of a phylogenetic 
diversity and the tight span of its induced metric. 



31 



Lemma 5.8. Let (X, dx) and (Y, dy) be complete real trees and let (X, 5x) and 
(Y,Sy) be the associated real-tree diversities. Then 

1. ip : X — > Y is a non-expansive map from {X, dx) to (Y, dy) if and only if 
it is a non-expansive map from (X,Sx) to (Y,Sy). 

2. if> : X — > Y is an embedding from (X, dx) to (Y, dy) if and only if it is an 
embedding from (X.Sx) to (Y, Sy). 

Proof. 

1. Suppose that ip: X — > Y is a non-expansive map from (X, dx) to (Y, dy). 
For any finite A C X, we have 5y(<fi(A)) — /i(conv(</>(A)). First note that 
conv((f)(A)) = (f>(conv(A)). Then note that since in this case the one-dimensional 
Hausdorff measure of a set is a limit of infima of the total length of countable 
covers of a set by geodesic segments [361 Section 6.1], n(4>(B)) < n(B) for all 
measurable B C X. This proves that tp is a non-expanding map with respect 
to diversities. The other direction is immediate. 

2. The argument follows as in 1, with showing fi(4>(B)) — /i(-B) for all measur- 
able b ex. a 



Theorem 5.9. Let (X, 6) be a phylogenetic diversity and let (X, d) be its induced 
metric. Let (T x , St) be the diversity tight span of (X, S) and let (T x , dx) be the 
metric tight span of (X,d). Then (T x ,dx) is isometric with the induced metric 
of(T 5 x ,S T ). 

Proof. 



By [H Theorem 8], (T x ,dx) is a real tree. Let (T x ,8 dT ) be the corresponding 
real-tree diversity, which is hyperconvex by Theorem 5.5 Let (T x ,ds) denote 



the induced metric of (T x ,5t). From Theorem 5.5 we have that (T Xl d$) is a 



complete real tree and is therefore a hyperconvex metric space [381 Theorem 
3.2]. 

Let Kd be the Kuratowski embedding from (X, d) to (T x , d-r) and let k$ be 
the Kuratowski embedding from (X, S) to (T x , 5t)- The map k$ is then also an 
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embedding between the induced metric (A, d) and the induced metric (T x , dg T ). 
Applying [33j Proposition 9.20(4))] in the category Met, there is an embedding 
<p : (T x ,d T ) -> [T x ,d ST ) such that 



k s = 4>o K d . 



(5.3) 



By Lemma 



5.8 



is also an embedding from the diversity (T x , Sd T ) to the 



diversity (T X ,S T ). For all A e Va^X), 

6(A) = 6 t (k s (A)) = S T (cl>(K d (A))) = 8 dT {K d {A)) 

so that K d embeds (X,S) in (T x ,8d T )- 

The identity map id T ^ on (T x , dx) is non-expansive and (j> is an embedding, 
so applying the definition of injective metric spaces to (T x ,dx) we have that 
there is a non-expansive map "0 from (T x ,d$ T ) to (T x ,dx) such that 



tp o = id T d 



(5.4) 



Applying Lemma |5.8| 1. we see that the map ifi is also a non-expansive map 
from (T X ,S T ) to (T x ,6 dT ). Combining ([K3) and we have 

lpOKg=lpO<fioK d 
= ld T d O Kd 



which is an embedding. By Lemma |3.7| we have that ip is an embedding, imply- 
ing that that is an isometry from (T x ,dx) to (T x ,d$ T ). □ 



6. Tight span and the Steiner tree problem 

Let A be a finite set of points in a metric space (M, d) . The ( metric ) Steiner 
tree problem is to find the shortest network that connects them. Clearly this 
network will always be a tree. More formally 

Metric Steiner Problem. 

Input: Subset A of a metric space (M, d). 
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Problem: Find a (graph theoretic) tree T for which X C V(T) C M and 

d(u,v) 

{u,v}£E(T) 

is minimised. 

Dress and Kriiger [32] examined an 'abstract' metric Steiner problem where 
one drops the constraint that V(T) C M. This abstract Steiner tree was one of 
the first distance-based criteria proposed for the inference of phylogenetic trees 
pM HO] , though it is now not widely used. Suppose that T is a tree with edge 
weights w : E(T) — ► 5R>o- Given u, v G V(T) we let w) denote the sum of 

edge weights along the path from u to v. 

Abstract Steiner Problem. 
Input: Finite metric space (X,d). 

Problem: Find a (graph theoretic) tree T and edge weighting w : E(T) — » ?R 
such that X C V(T), d w (x, y) > d(x, y) for all x, y £ X and 

e£B(T) 

is minimised. 

Suppose that T is a solution to the metric Steiner problem for X C Af . 
Define the weight function w : E(T) — > 5ft by w({u,w}) = d(u,v). Then, by 
the triangle inequality, d w (x,y) > d(x,y) for all x, y G X. It follows then that 
the length of the minimum abstract Steiner tree for (X, d\x) is a lower bound 
for the metric Steiner problem. Dress and Kriiger showed that the lower bound 
becomes tight when (M, d) equals (T x ,d T ), the metric tight span of X. 

Theorem 6.1 f |32j). Let (X,d) be a finite metric space. For every solution 
(T, w) to the abstract Steiner tree problem there is a map <f> : V(T) — > Tx 
such that 4>( x ) = k{x) for all x G X and w({u,v}) — dr(^>(u), (j>(v)) for all 
{u,v}(EE(T). 
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Hence the length of the minimal Steiner tree for k(X) in (Tx, g?t) equals the 
length of the minimal abstract Steiner tree for (X, d) and the minimal abstract 
Steiner trees can be embedded within the tight span. A direct corollary is that 
if d is tree-like then the abstract Steiner tree equals the tree corresponding to 
d. 

Here we show that, using diversities, we can obtain a tighter bound on the 
metric Steiner problem than that given by the abstract Steiner problem. Given 
a tree T with edge weights w and A C V(T) we let 5 W (A) be the sum of edge 
weights in the smallest subtree of T connecting A. Hence (X, 5 w \x) is a phylo- 
genctic diversity. 

Diversity Steiner Problem. 
Input: Finite diversity (X, 6). 

Problem: Find a (graph theoretic) tree T and edge weighting w : E(T) — > 5i 
such that X C V(T), S W (Y) > S(Y) for all Y C X, and 

eEE(T) 

is minimised. 

Let X be a finite subset of a metric space (M, d). For each A C X let £(A) 
denote the minimum length of a (metric) Steiner tree connecting the points A 
in the metric space (M,d). We see that (X,£) is a diversity. For each k > 2, 
consider the truncated diversity 5^ defined by 

S W (A) = max{£(B) : \B\ < k, B C A} 

for all A C X. 

Proposition 6.2. // (T, w) is a minimum length solution for the diversity 
Steiner problem applied to 5^ then the length X)ee-E(T) w ( e ) of T is a lower 
bound for £(X), the optimal length of a metric Steiner tree for X. 

Proof. 

Let (T',w') be a solution to the metric Steiner problem and let 8 W > be the 



35 



associated phylogenetic diversity. Then for all B such that \B\ < k we have that 
6 W ,(B), the length of T restricted to B, is bounded below by 1(B) = S^(B). 
It follows that <5 (fe) (A) < S W >(A) for all A C X, so that (T',u/) is a potential 
solution for the diversity Steiner problem. As (T, w) is optimal, we have 

J2 w(e)< ™'{e)=i(X). 

eSB(T) eGB(T') 

□ 

For fc = 2, the bounds provided by the Proposition |6.2| coincide with those 
given by length of the minimum abstract Steiner tree. As k increases, the 
bounds returned by the diversity Steiner tree applied to 5^ will tighten, until 
eventually the diversity Steiner tree will coincide with the metric Steiner tree. 
Furthermore, we have a direct extension of Theorem |6.1| stating that these 
diversity Steiner trees will all be contained in the diversity tight span. 

Theorem 6.3. Let (X, S) be a finite diversity. For every solution (T, w) to 
the diversity Steiner tree problem for (X, 5) there is a map 4> '■ V(T) — > Tx 
such that 4>( x ) — k(x) for all x € X and w({u, v}) = 5t({4>(u), <fi(v)}) for all 
{u,v}eE(T). 

Proof. 

Let 8 W be the diversity on V(T) given by (T, w), as defined above. Since (T, w) 
solves the diversity Steiner problem, 5 W (A) > 5(A) for all A C X. Let n denote 
the canonical embedding from X to Tx- Then n is a non-expansive map from 
(X,5 W \ X ) to (T X ,S T ). 

The tight span (Tx, St) is injective. Hence there is a non-expansive map <f> 
from (V(T),S W ) to (Tx,St) such that <j>(x) = k(x) for all x € X. For each u,v 
let w'({u,v}) = 5 T ({(f>(u),<t>(v)}). Then 

w({u,v}) = S w ({u,v}) > S T ({<f)(u),(t)(v)}) = w'({u,v}) 

for all V. 
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Consider A C X, and let Ea be the set of edges in the smallest subtree of 
T containing A. By the triangle inequality, 

5 W ,{A)= w'(e)>6(X). 

eeE A 

Hence (T, w') is a candidate for the diversity Steiner problem, but since (T, w) is 
already minimum, J2 e eE(T) w ( e ) — ^2 e eE(T) w '( e )- It follows that w(e) = w'(e) 
for all e G £7(T). □ 
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